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Abstract 

We are interested in modelling Darwinian evolution, resulting from the interplay of 
phenotypic variation and natural selection through ecological interactions. Our mod- 
els are rooted in the microscopic, stochastic description of a population of discrete 
individuals characterized by one or several adaptive traits. The population is mod- 
elled as a stochastic point process whose generator captures the probabilistic dynamics 
over continuous time of birth, mutation, and death, as influenced by each individual's 
trait values, and interactions between individuals. An offspring usually inherits the 
trait values of her progenitor, except when a mutation causes the offspring to take an 
instantaneous mutation step at birth to new trait values. We look for tractable large 
population approximations. By combining various scalings on population size, birth 
and death rates, mutation rate, mutation step, or time, a single microscopic model 
is shown to lead to contrasting macroscopic limits, of different nature: deterministic, 
in the form of ordinary, integro-, or partial differential equations, or probabiHstic, 
Hke stochastic partial differential equations or superprocesses. In the Hmit of rare 
mutations, we show that a possible approximation is a jump process, justifying rigor- 
ously the so-called trait substitution sequence. We thus unify different points of view 
concerning mutation-selection evolutionary models. 

Key-words: Darwinian evolution, birth-death-mutation-competition point process, mutation- 
selection dynamics, nonlinear integro-differential equations, nonlinear partial differential 
equations, nonlinear superprocesses, fitness, adaptive dynamics. 



1 Introduction 

In this paper, we are interested in modelling the dynamics of populations as driven by 
the interplay of phenotypic variation and natural selection operating through ecological 
interactions, i.e. Darwinian evolution. The fundamental property of evolving systems is 
the propensity of each individual to create and to select the diversity. This feature requires 
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to focus on the stochastic dynamics of each individual in the population. The study of 
such evolutionary-ecological models is very complicated, and several approximations have 
been proposed. Firstly, Bolker and Pacala and Dieckmann and Law [Hj have introduced 
the moment equations of the distribution of traits in the population and studied differ- 
ent moment closure heuristics. Secondly, various nonlinear macroscopic models (integro- 
differential equations, partial differential equations, superprocesses) have been proposed 
without microscopic justification. Finally, the emerging field of adaptive dynamics have 
proposed a new class of macroscopic models on the evolutionary time scale, defined as 
jump processes and ordinary differential equations (trait substitution sequences, Metz et 
al. Pni) canonical equation of adaptive dynamics, Dieckmann and Law 0)- In all these 
cases and from a biological point of view, the pathway from microscopic to macroscopic 
models deserves a firm mathematical pavement, at least to clarify the significance of the 
implicit biological assumptions underlying the choice of a particular model. 

In this work, we unify several macroscopic approximations by recovering them from a 
single microscopic model. In particular, we point out the importance of large population 
assumptions and that the nature of the approximation strongly depends on the combination 
of various scalings of the biological parameters (birth and death rates, mutation rate, 
mutation step and time). 

This paper starts (SectionEJ with the microscopic description of a population of discrete 
individuals, whose phenotypes are described by a vector of trait values. The population 
is modelled as a stochastic Markov point process whose generator captures the probabilis- 
tic dynamics over continuous time of birth, mutation and death, as influenced by each 
individual's trait values and interactions between individuals. The adaptive nature of a 
trait implies that an offspring usually inherits the trait values of her progenitor, except 
when a mutation occurs. In this case, the offspring makes an instantaneous mutation step 
at birth to new trait values. We will refer to the state space parameterized by adaptive 
traits as the trait space, and will often (slightly abusively) call trait the actual trait value. 
This process is defined as the solution of a stochastic differential equation driven by point 
Poisson measures fSection 12. ip . In Section [T2l we give an algorithmic construction of 
the population point process and propose some simulations, for various parameters, of an 
asymmetrical example developed in Kisdi jTH]- Next, we prove that the point population 
process is a measure- valued semimartingale and compute its characteristics (Section ESI- 
Then we look for tractable approximations, following different mathematical paths. Our 
first approach (Sectional) aims at deriving deterministic equations to describe the moments 
of trajectories of the point process, i.e. the statistics of a large number of independent real- 
izations of the process. We explain the difficult hierarchy between these equations coming 
from competition kernels and preventing, even in the simple mean-field case, decorrelations 
and tractable moment closure. The alternative approach involves renormalizations of the 
point process based on a large population limit. The measure- valued martingale properties 
of the renormalized point process allow us to show that, according to different scalings of 
birth, death and mutation rates, one obtains qualitatively different limiting partial differ- 
ential equations and the appearance or not of some demographic stochasticity. We show 
in Section 14.11 that by itself, the large-population limit leads to a deterministic, nonlin- 
ear integro-differential equation. Then, in Section 14.2.11 we combine the large-population 
limit with an acceleration of birth (hence mutation) and death according to small mu- 
tation steps. That yields either a deterministic nonlinear reaction-diffusion model, or a 
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stochastic measure-valued process (depending on the acceleration rate of the birth-and- 
death process). If now this acceleration of birth and death is combined with a limit of 
rare mutations, the large-population limit yields a nonlinear integro-differential equation 
either deterministic or stochastic, depending here again on the speed of the scaling of the 
birth-and-death process, as described in Section 14.2.21 

In Section El we model a time scale separation between ecological events (fast births 
and deaths) and evolution (rare mutations), for an initially monomorphic population. The 
competition between individuals takes place on the short time scale. In a large population 
limit, this leads on the mutation time scale to a jump process over the trait space, where 
the population stays monomorphic at any time. Thereby we provide a rigorous justification 
to the notion of trait substitution sequence introduced by Metz et al. |18| . 

2 Population point process 

Even if the evolution manifests itself as a global change in the state of a population, its basic 
mechanisms, mutation and selection, operate at the level of individuals. Consequently, we 
model the evolving population as a stochastic interacting individual system, where each 
individual is characterized by a vector of phenotypic trait values. The trait space X is 
assumed to be a closed subset of W^, for some d> 1. 

We will denote by Mf(X) the set of finite non-negative measures on X. Let also Ai 
be the subset of Mp{X) consisting of all finite point measures: 

M = " ^ 0,xi,...,Xn G ^1 . 

Here and below, 6x denotes the Dirac mass at x. For any m G Mf{X), any measurable 
function / on X, we set {m, f) = fdm. 

We aim to study the stochastic process ut, taking its values in Ai, and describing the 
distribution of individuals and traits at time t. We define 

i{t) 

^t = Y.^xi. (2-1) 

i=l 

I{t) G N standing for the number of individuals alive at time t, and X}^ x/^*^ describing 
the individual's traits (in X). 

For a population u = "^l^i S^i, and a trait x G A", we define the birth rate b{x, V * 
z/(x)) = b{x, "^i^i V{x — x^)) and the death rate d{x, U * I'ix)) = d(x, Y2i=i ~ ^*)) 
of individuals with trait x; V and U denote the interaction kernels affecting respectively 
reproduction and mortality, let fj,{x) and M(x,z)dz be respectively the probability that 
an offspring produced by an individual with trait x carries a mutated trait and the law of 
this mutant trait. 

Thus, the population evolution can be roughly summarized as follows. The initial 
population is characterized by a (possibly random) counting measure uq £ M. at time 
0, and any individual with trait x at time t has two independent random exponentially 
distributed "clocks": a birth clock with parameter b{x,V * ft(x)), and a death clock with 
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parameter d{x, U * ft(x)). If the death clock of an individual rings, this individual dies and 
disappears. If the birth clock of an individual with trait x rings, this individual produces an 
offspring. With probability 1 — n{x) the offspring carries the same trait x; with probability 
n{x) the trait is mutated. If a mutation occurs, the mutated offspring instantly acquires a 
new trait z, picked randomly according to the mutation step measure M{x,z)dz. 

Thus we are looking for a A^-valued Markov process {i't)t>o with infinitesimal generator 
L, defined for real bounded functions (p by 

L<p{u) = h{x\ V * u{x')){l - iJi{x^)mu + 5,.)- 0(1/)) 

i=l 

+ S2h{x\V *v{x'))n{x') [ + -(p{u))M{x\z)dz 
i=i ■'^ 
I 

+ d{x\ U * u{x')mu - ,5,0 - H'^))- (2.2) 

i=l 

The first term of H2.2p captures the effect on the population of birth without mutation; the 
second term that of birth with mutation, and the last term that of death. The density- 
dependence makes all terms nonlinear. 



2.1 Process construction 

Let us justify the existence of a Markov process admitting L as infinitesimal generator. 
The explicit construction of {i't)t>o also yields three side benefits: providing a rigorous and 
efficient algorithm for numerical simulations (given hereafter), laying the mathematical 
basis to derive the moment equations of the process (Section |3I), and establishing a general 
method that will be used to derive some large population limits (Sections ^ and EJ. 

We make the biologically natural assumption that the trait dependency of birth pa- 
rameters is "bounded", and at most linear for the death rate. Specifically, we assume 

Assumptions (H): 

There exist constants b, d, f7, V and C and a probability density function M on 
such that for each u = J2i=i and for x, z G A", 

6(x, V * v{x)) < b, d{x, U * iy{x)) < d{l + /), 
U{x) < U, V{x) < V, 
M{x,z) < CM{z - x). 

These assumptions ensure that there exists a constant C, such that the total event rate, 
for a population counting measure u = Yli=i ^x^^ obtained as the sum of all event rates, is 
bounded by CI{1 + /) . 

Let us now give a pathwise description of the population process {i't)t>o- We introduce 
the following notation. 

Notation 2.1 Let W = N\{0}. Let H = {H\ H'' , ...) : M ^ {W^)^' he defined by 
(t(i)) 2;o-(n) ) 0, 0, ...), where Xo-(i) ■■■ ^ a;<j{n)7 for some arbitrary 
order ^ on M'^ f for example the lexicographic order). 
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This function H allows us to overcome the following (purely notational) problem. Choosing 
a trait uniformly among all traits in a population v £ M consists in choosing i uniformly 
in {1, {v, 1)}, and then in choosing the individual number i (from the arbitrary order 
point of view). The trait value of such an individual is thus H'^{u). 

We now introduce the probabilistic objects we will need. 

Definition 2.2 Let (0,jr, P) be a (sufficiently large) probability space. On this space, we 
consider the following four independent random elements: 

(i) a Ai -valued random variable vq (the initial distribution), 

(ii) independent Poisson point measures Mi{ds, di, dO), and M^{ds, di, dO) on [0, cxd) x N* x 

M"*", with the same intensity measure ds ^X]fc>i^fc(^0) (^^^ "clonal" birth and 
the death Poisson measures), 

(iii) a Poisson point measure M2{ds,di,dz,d9) on [0, oo) x N* x A" x with intensity 

measure ds iJ2k>i^k{di)) dzd9 (the mutation Poisson measure). 



Let us denote by {J^t)t>o the canonical filtration generated by these processes. 

We finally define the population process in terms of these stochastic objects. 

Definition 2.3 Assume (H). A {J- t)t>o- adapted stochastic process v = {i't)t>o is called a 
population process if a.s., for all t > 0, 



t't - z^o + / ^m{u,.)^i<(us-,i)} 

'[0,t]xN*xR+ 

'^{e<b{Hi{us-),v*us^{HHus^))){i-tiiH'{iys-)))}^iids, di, dO) 

'[0,t]xN*xA'xR+ 

'^{e<b{W(vs-),V*Vs-(Hi{vs-)))iJL(Hi(v,-))M(Hi(vs-),z)}M2{ds, di, dz,d9) 
[0,t]xN*xM+ 

Let us now show that if v solves H2.,Sp , then v follows the Markovian dynamics we are 
interested in. 

Proposition 2.4 Assume (H) and consider a solution {i^t)t>o of Eq. \2. such that 
E{suY>t>T{i'tA?) < +00, VT > 0. Then {ut )t>o is a Markov process. Its infinitesimal 
generator L is defined for all bounded and measurable maps (f) : M ^ M, all v £ M, 
by \2.^) . In particular, the law of {vt)t>o does not depend on the chosen order ^. 

Proof The fact that {i't)t>o is a Markov process is classical. Let us now consider 
a function (f) as in the statement. With our notation, vq = "^^=1^ ^H'-iuo)- A simple 
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computation, using the fact that a.s., (pii^t) = (Pi'^o) + Ss<t(</'(^s- + (^s ~ ^s-)) ~ 4>i^s-)), 
shows that 



J[0,t]xN*xIR+ 



'^{e<b{Hi{us-),v*iys-{H^{iys-))){i-KHi(u,^)))}Mi{ds,di,d6) 

+ / {<Pi^s- + Sz) - (Pil^s-)) ^{i<(u,_,l)} 

J[0,t]xN*xA'xR+ 

'^{e<b{Hi{us-),V*iys-{H'{iys-)))fJ.{Hi{us-))M{Hi{us-),z)}M2{ds,di,dz, d9) 

+ i i'Pi'^s- - 5h^{us-)) - 'A(j^s-)) l{i<(i^s-,l>} 

J[0,t]xN*xR+ 

'^{e<d(Hi{v,.),u*v,.{Hi{v,.)))}^^{ds, di, dO). 
Taking expectations, we obtain 
E{4>{ut)) = E{4>{uo)) 

+ / {ct>{vs + 5.) - <l){v,)) h{H\vs), V * Vs{H\vs)))^i{H'\vs))M{H\vs). z)dz 
Jx 

+ {ct>{T^s - ^Hriu.)) - <i>{T^s)) d{H\us), U * i^siH\i^s)))])ds 



Differentiating this expression at t = leads to H2.2p . □ 
Let us show existence and moment properties for the population process. 

Theorem 2.5 (i) Assume (H) and that E {{i^o, 1)) < cxd. Then the process (ft)i>o defined 
by DeHniti,on \2.^ is well defined on M+. 

(ii) If furthermore for some p > I, E ((fo, 1)^) < cxd, then for any T < oo, 

E{ sup {ut,l)P) < oo. (2.4) 
te[o,T] 

Proof We first prove (ii). Consider the process {i't)t>o- We introduce for each n the stop- 
ping time T„ = inf {t > 0, {vt, 1) > n}. Then a simple computation using Assumption (H) 
shows that, neglecting the non-positive death terms, 

sup {l^s, If < Wo, If + / {{{'^s-, 1) + 1)^ - Ws-, If) M^<Ws-,l)} 

s€[0,tATn] J[0,tAT„]xN*xR+ 



J[0,t]xN*xA'xM+ 

^{e<biHi(us-)y*Us-iHiius-)))fi(H^(i^s-))M(Hi(iys-),z)}M2{ds,di,dz, dO). 



Using the inequality {l + x^ — x^ < Cp{l + xP ^) and taking expectations, we thus obtain, 
the value of Cp changing from line to line, 

E{ sup {us,1)'')<Cp(i + e( r^"b{{i^s-A) + {j^s-Af)ds)) 

sg[0,tAT„] V \Jo J J 

<Gp{\-rE[j^ (l + (z.,Ar„,inds^) . 

The Gronwall Lemma allows us to conclude that for any T < oo, there exists a constant 
Cp^T-, not depending on n, such that 

E{ sup (z^t, 1)^) < Cp,T. (2.5) 

tG[0,TAT„] 

First, we deduce that r„ tends a.s. to infinity. Indeed, if not, one may find a Tq < oo such 
that ETo = P (sup„ T„ < To) > 0. This would imply that E (^sup^gp^ToAr^] 1)^) > ^T^n^ 
for all n, which contradicts (|2.5p . We may let n go to infinity in (|2.5|1 thanks to the Fatou 
Lemma. This leads to (|2.4|) . 

Point (i) is a consequence of point (ii). Indeed, one builds the solution {i't)t>o step by 
step. One only has to check that the sequence of jump instants T„ goes a.s. to infinity as 
n tends to infinity. But this follows from H2.4p with p = 1. □ 



2.2 Examples and simulations 

Let us remark that Assumption (H) is satisfied in the case where 

b{x,V*iy{x)) = b{x), d{x,U * v{x)) = d{x) + a{x) i U{x-y)u{dy), (2.6) 

Jx 

where 6, d and a are bounded functions. 

In the case where moreover, = 1, this individual-based model can also be inter- 
preted as a model of "spatially structured population", where the trait is viewed as a 
spatial location and the mutation at each birth event is viewed as dispersal. This kind 
of models have been introduced by Bolker and Pacala ((2 EI) and Law et al. (^Sl); and 
mathematically studied by Fournier and Meleard |T2]. The case U = 1 corresponds to a 
density-dependence in the total population size. 

We will consider later the particular set of parameters for the logistic interaction model, 
taken from Kisdi |15| and corresponding to a model of asymmetrical competition: 

;P = [0,4], d{x)=0, a{x) = l, ^i{x) = fi, 

b{x)=A-x, Uix-y) = ^(l-——: ^-^ ^) (2.7) 

K\ 1 + l,2exp(-4(x - ?/))y 

and M{x, z)dz is a Gaussian law with mean x and variance conditionned to the fact that 
the mutant stays in [0, 4]. As we will see in Sectional the constant K scaling the strength of 
competition also scales the population size (when the initial population size is proportional 
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to K). In this model, the trait x can be interpreted as body size. Equation (j2.7|l means 
that body size influences the birth rate negatively, and creates asymmetrical competition 
reflected in the sigmoid shape of U (being larger is competitively advantageous). 

Let us give an algorithmic construction for the population process (in the general case), 
simulating the size I{t) of the population, and the trait vector Xj of all individuals alive 
at time t. 

At time t = 0, the initial population z/q contains 1(0) individuals and the corresponding 
trait vector is Xq = (XQ)i<j</(o)- We introduce the following sequences of independent 
random variables, which will drive the algorithm. 

• The type of birth or death events will be selected according to the values of a sequence 
of random variables {Wk)k&n* with uniform law on [0, 1]. 

• The times at which events may be realized will be described using a sequence of 
random variables (Tfc)fcgpj with exponential law with parameter C. 

• The mutation steps will be driven by a sequence of random variables (Zfe)fcgN with 
law M{z)dz. 

We set To = and construct the process inductively for fc > 1 as follows. 

At step k — 1, the number of individuals is /fc_i, and the trait vector of these individuals 
is Xr,_i. 

Let Tfc = Tfc_i + -. Notice that represents the time be- 

-'fc-il-'fc-i + 1) _ -'fc-i(-'fc-i + 1) 

tween jumps for Ik-i individuals, and C{Ik-i + 1) gives an upper bound on the total event 
rate for each individual. 

At time T^, one chooses an individual ik = i uniformly at random among the Ik-i alive 
in the time interval [T/c_i, T^); its trait is -^^t^.i • (^^ ^k-i = then t'f = for all t > Tk~i ) 

• lfO<Wk< — ^^77 = W^iXXt,_i), then the chosen in- 

dividual dies, and Ik = Ik-i — 1- 

. If WI{Xt,_,) <Wk< VF2(Xt,_J, where 



C{Ik-i + 1) 



then the chosen individual gives birth to an offspring with trait X^ , and Ik = 
Ik-i + 1. 

If Wi{XT,_,) <Wk< Wi{XT,_,,Zk), where 
Wl{XT,_„Zk) = Wi{XT,^,)+ 

CM{Zk){Ik-i + l) 

then the chosen individual gives birth to a mutant offspring with trait X^^__^ + Zk, 
and Ik = 4-1 + 1- 
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• UWk> W|(Xt^_j, Zfc), nothing happens, and h = 4-i- 

Then, at any time t > 0, the number of individuals is defined by I{t) = X]fc>o ^{Tk<t<Tk+i}^k 
and the population process is obtained as ut = X^fc>o ^{Tk<t<Tk+i} Yliti ^xi^ ■ 

The simulation of Kisdi's example (|2.7p can be carried out following this algorithm. 
We can show a very wide variety of qualitative behavior according to the value of the 
parameters cr, /i and K. 

In the following figures, the upper part gives the distribution of the traits in the pop- 
ulation at any time, using a grey scale code for the number of individuals holding a given 
trait. The lower part of the simulation represents the dynamics of the total size I{t) of the 
population. 

These simulations will serve to illustrate the different mathematical scalings described 
in Sections 0] and 13 Let us observe for the moment the qualitative differences between the 
cases where K is large (Fig. ^(c)), in which a wide population density evolves regularly 
(see Section l¥?T|l and where fi is small (Fig. ^(d)), in which the population trait evolves 
according to a jump process (see Section I^^. 

The simulations of Fig. |21 involve an acceleration of the birth and death processes (see 
Section 1121) as 

b{xX) = K"^ + b{x) and d{x,C) = K"^ + d(x) + a{x)C. 

There is a noticeable qualitative difference between Fig. |21 (a) and (b), where t] = 1/2, 
and Fig. El (c) and (d), where rj = 1. In the latter, we observe strong fluctuations in the 
population size and a finely branched structure of the evolutionnary pattern, revealing a 
new form of stochasticity in the large population approximation. 

2.3 Martingale Properties 

We finally give some martingale properties of the process {yt)t>o, which are the key point 
of our approach. 

Theorem 2.6 Assume (H), and that for some p>2, E{{uq^ 1)^) < oo. 

(i) For all measurable functions (j) from M. into M such that for some constant C , for all 

ueM, + \L(l){iy)\ < C(l + {u, If), the process 

(l){iyt) - (t>{iyo) - [ L(P{vs)ds (2.8) 

JO 

is a cadlag {J-t)t>o-martingale starting from 0. 

(ii) Point (i) applies to any function 4>{v) = {f, fY, with < g < p — 1 and with f hounded 

and measurable on X . 
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(a) At = 0.03, K = 100, a = 0.1. 



(b) n = 0.03, K = 3000, a = 0.1. 
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(c) /i = 0.03, K = 100000, a = 0.1. 



(d) ^i = 0.00001, K = 3000, (7 = 0.1. 



Figure 1: Numerical simulations of trait distributions (upper panels, darker is higher fre- 
quency) and population size (lower panels). The initial population is monomorphic with 
trait value 1.2 and contains K individuals, (a-c) Qualitative effect of increasing system size 
(measured by parameter K). (d) Large sysjgm size and very small mutation probability 





(c) II = 0.3, K = 10000, (7 = 0.3/A''"'2, = (d) /i = 0.3, A" = 10000, o = O.S/J^'"'^ r; = 

1. 1. 

Figure 2: Numerical simulations of trait distribution (upper panels, darker is higher fre- 
quency) and population size (lower panels) for accelerated birth and death and concur- 
rently increased system size. Parameter r] -^^etween and 1) relates the acceleration of 
demographic turnover and the increase of system size, (a) Rescaling mutation step, (b) 
Rescaling mutation probability, (c-d) Rescaling mutation step in the limit case = 1; two 
samples for the same population. The initial population is monomorphic with trait value 



(iii) For such a function f , the process 

Ml = {uu f) - f) - f I I ( (1 - V * us{x)) - d{x, U * Mx)) ] fix) 



Jo JX 

+ n{x)b{x,V *Vs{x)) j f{z)M{x,z)dz^Vs{dx)ds (2.9) 
is a cadlag square integrahle martingale starting from with quadratic variation 
{Mf)t = ^* ^ I (^(1 - ^i{x))h{x, V * us{x)) - d{x, U * u,{x))j f\x) 

+ fi{x)b{x,V *jys{x)) / f'^{z)M{x,z)dz\us{dx)ds. (2.10) 



X 



Proof First of all, note that point (i) is immediate thanks to Proposition 12.41 and H2.4p . 
Point (ii) follows from a straightforward computation using (|2.2|1 . To prove (iii), we first 

assume that E (^{i^o, 1)^^ < oo. We apply (i) with 4>{i') = {v, /). This yields that is a 
martingale. To compute its bracket, we first apply (i) with (f){v) = (i^, /)^ and obtain that 



t 

2/ 



{yufr-{yoJY- J^^[il-fiix))bix,V*i^six)){f\x) + 2fix){i.sJ)) 

+ d{x, U * Vs{x))U\x) - 2f{x) {Vs, /)) 

+ ii{x)b{x, V * lysix)) jj fi^) + 2/(^) Ws, f))M{x, z)dz^i^sidx)ds (2.11) 

is a martingale. In another hand, we apply the Ito formula to compute {vt, /)^ from H2.9p . 
We deduce that 

{ut, ff - {uo, ff - 1^ 2 {Us, /) ^ I ((1 - /i(x))6(x, V * us{x)) - d{x, U * Us{x))^ f{x) 
+ n{x)b{x,V *Us{x)) j f{z)M{x,z)dz^Us{dx)ds- {Mf)t (2.12) 

is a martingale. Comparing (|2.1HI and H2.12|l leads to H2.1()|l . The extension to the case 
where only E ^(fO;l)'^^ < cxd is straightforward, since even in this case, < oo 

thanks to with p = 2. □ 



3 Moment equations 

Moment equations have been proposed by Bolker and Pacala (0[3) and Dieckmann and 
Law ([Hj) as handy analytical models for spatially structured populations. 

The philosophy of moment equations is germane to the principle of Monte-Carlo meth- 
ods: computing the mean path of the point process from a large number of independent 
realizations. (Another approach, as we shall see in Section ^ is to model the behavior of 
a single trajectory when it is the initial number of individuals which is made large). 
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Let us define the deterministic measure E{i') associated with a random measure i' 

by / Lp{x)E{u){dx) = E{ I if{x)v{dx)). Taking expectations in H2.9p . we obtain some 

J X J X 

formula for J-^ ip{x)E(h'){dx) involving the expectations of integrals with respect to ^{dx) 

or to v{dx)u{dy) . Nevertheless, this equation is very intricate and presents an unresolved 

hierarchy of nonlinearities. Writing an equation for E{v{dx)v{dy)) could be possible but 

will involve integrals with respect to u{dx)ij{dy)i'{dz) and so on. Whether this approach 

may eventually help describe the population dynamics in the trait space is still unclear. 

Let us consider the case of spatially structured population (see Section I2.2|l where 

d{x,C) = d{x) +a{x)Q, b{x,C) = H^) and fi{x) = 1. Let N{t) = E{I{t)) where I{t) is the 

number of individuals at time t. Taking expectations on H2.9p with (/? = 1 yields: 



N{t) = N{0) + J^ E (^j 



{b{x) — d{x))vs{dx) — / a{x)U{x — y)vs{dx)vs{dy) \ ds. (3.1) 
X JxxX ) 

In the specific case where d and a are independent of (the spatial location) x, (cf. [16j), 
H3.1|l recasts into 



(3.2) 



N = {b - d)N - aE ( [ Uix - y)vt{dx)vt{dy) \ . 

\JxxX J 

Even in the specific mean-field case where ?7 = 1 , we get 

N ={b-d)N -aEy f vt{dx)vt{dy)] . (3.3) 



XxX 

The quadratic term corresponding to spatial correlations can not be simplified and H3.3|l 
allows us to precisely identify the mathematical issues raised by the problem of moment 
closure. In Section l¥?T| we will see that one needs the additional large population hypothesis 
to decorrelate the quadratic term and to refind the well-known logistic equation. 

Nevertheless, even if we are not able to produce a closed equation satisfied by E{i'), 
we are able to show, in the general case, the following qualitative important property 
concerning the absolute continuity of the expectation of Vf 

Proposition 3.1 Assume (H), that E{{vq^1)) < oo and that E{i>q) is absolutely continu- 
ous with respect to the Lebesgue measure. Then for all t > 0, E{yt) is absolutely continuous 
with respect to the Lebesgue measure. 

Remark 3.2 This implies in particular that, when the initial trait distribution E{vq) has 
no singidarity w.r.t. the Lebesgue measure, these singularities, such as Dirac masses, can 
only appear in the limit of infinite time. 

Proof Consider a Borel set A ofW^ with Lebesgue measure zero. Consider also, for each 
n > 1, the stopping time r„ = inf {t > 0, {i^t, 1) > n}. A simple computation allows us to 
obtain, for all t > 0, all n > 1, 

/ rtATn r > 

E{{ut^r„,lA)) <E{{uo,lA))+b Ei / / lA{x)us{dx)ds 

\Jo Jx / 

+ h E(^j j (yj ^A{z)M{x,z)dz^ Us{dx)ds^. 
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By assumption, the first term on the RHS is zero. The third term is also zero, since for 
any x £ X, J.^ 1a{z)M{x, z)dz = 0. By Gronwall's lemma, we conclude that for each n, 
-^((^iAT„i 1a)) is zero. Thanks to H2.4p with p = 1, r„ a.s. grows to infinity with n, which 
concludes the proof. □ 



4 Large-population renormalizations of the individual-based 
process 

The moment equation approach outlined above is based on the idea of averaging a large 
number of independent realizations of the population process initiated with a finite number 
of individuals. If K scales the initial number of individuals, the alternative approach 
consists in studying the exact process by letting that system size become very large and 
making some appropriate renormalizations. Several types of approximations can then be 
derived, depending on the renormalization of the process. 

For any K, let the set of parameters Uk, Vk, bx, dx, Mk, satisfy the Assump- 
tion (H). Let f/^ be the counting measure of the population at time t. We define the 
measure-valued Markov process {X^)t>Q by 

t ^ t 

As the system size K goes to infinity, we need to assume the 

Assumption (HI): The parameters Uk, Vk, bK, dx, Mk and fiK are all continuous, 
( I— > b{x, and C i— > d{x, () are Lipschitz for any x £ X, and 

Uk{x) = U{x)/K, Vk{x) = V{x)/K. 

A biological interpretation of this renormalization is that larger systems are made up of 
smaller individuals, which may be a consequence of a fixed amount of available resources to 
be partitioned among individuals. Thus, the biomass of each interacting individual scales 
as 1/K, which may imply that the interaction effect of the global population on a focal 
individual is of order 1. Parameter K may also be interpreted as scaling the resources 
available, so that the renormalization of Uk and Vk reflects the decrease of competition 
for resources. 

The generator of {i^t^)t>o is given by (j2.2|l . with parameters Uk, Vk, bK, dK, Mk, 
IXk- The generator of {X^)t>Q is obtained by writing, for any measurable function cp 
from Mf{X) into M and any v £ Mf{X), 

where (/>'^(/i) = 4>{l^/K). Then we get 

L^</.(z.) = K [ bK{x, V * u{x)){l - fiK{x)){<P{iy + 4^<5x) - Hiy))ij{dx) 
Jx ^ 

+ K I I bK{x,V*u{x))fiK{x)i(p{u + ^6,)-(f)iv))MK{x,z)dzu{dx) 
Jx Jx ^ 

+ K I dKix,U* - - (l){iy))iy{dx). (4.1) 

Jx ^ 
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By a similar proof as the one of Section 12.31 we may summarize the moment and 
martingale properties of . 

Proposition 4.1 Assume that for some p> 2, E{{Xq- , lY) < +oo. 

(1) For any T > 0, E{supt^[o,T]{Xt^ , 1)^) < +oo. 

(2) For any bounded and measurable functions (f) on Mp such that \4>{i^)\ + iL^'t'i^)] < 

C(l+ < u,! >^), the process (k(X^) - <p{X^) - /o L^(t){Xf)ds is a cadlag mar- 
tingale. 

(3) For each measurable bounded function f , the process 

mf'^ = (Xf,/)-(Xo^,/) 

- / / {bK{x, V * Xf (x)) - dK{x, U * X^{x)))f{x)X^{dx)ds 
Jo Jx 

' lo Ix ^ * ^^^''^ ( X ^(^)^^(^' - fix)^Xfidx)ds 

is a square integrable martingale with quadratic variation 
(m^'^)t = -ij^ l^fiKix)bK{x,V*X^{x))(^j^f\z)MK{x,z)dz-f\x)^Xf{dx)ds 

^ lo I ^ * ^^^""^^ ^ ^ * ^s{x)))f'^{x)X^{dx)ds^ (4.2) 

The search of tractable limits for the semimartingales {X^ , f) yields the different choices 
of scalings of the parameters developed in this section. In particular, we obtain the deter- 
ministic or stochastic nature of the approximation by studying the quadratic variation of 
the martingale term, given in l|4.2|l . 

4.1 Large-population limit 

We assume here that bx = b, dx = d, fix = A*, Mx = M. 

Theorem 4.2 Assume Assumptions (H) and (HI). Assume moreover that the initial con- 
ditions X^ converge in law and for the weak topology on Mp{X) as K increases, to a finite 
deterministic measure ^q, and that sup;^ i?((XQ^, 1)^) < +oo. 

Then for any T > 0, the process {X^^)t>o converges in law, in the Skorohod space 
n{[0,T], Mf{X)) , as K goes to infinity, to the unique deterministic continuous function 
S C([0, T], Mir(^Y)) satisfying for any bounded / : A:" — > M 

(6,/> = (Co,/)+ r / f{x)[{l- i,{x))b{x,V*Ux))-d{{x,U*Ux))]Udx)ds 
Jo Jx 

lo Ix ^ * ^^^^^^ (X ^(^)^(''' ^)^^) Udx)ds (4.3) 
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The proof of Theorem 14.21 is let to the reader. It can be adapted from the proofs of 
Theorem I4.,SI and 14.51 below, or obtained as a generalization of Theorem 5.3 in jT2I. This 
result is illustrated by the simulations of Figs. Q (a)-(c). 

Main Examples: 

(1) A density case. Following similar arguments as in the proof of Proposition mi one 

shows that if the initial condition .^o has a density w.r.t. Lebesgue measure, then the 
same property holds for the finite measure , which is then solution of the functional 
equation: 

dtUx) = [(1 - m(x))6(x, V * Ct{x)) - d{x, U * 6(x))] 

+ / Miy,x)f,iy)biy,V *^tiy)My)dy (4.4) 

for all X G A' and t > 0. Desvillettes et al. [B] suggest to refer to as the population 
number density; then the quantity nt = Jp^^t{x)dx can be interpreted as the total 
population density over the whole trait space. 

(2) The mean field case. As for moment equations (cf. Section [31), the case of spatially 

structured populations with constant rates 6, d, a is meaningful. In this context, H4.4|l 
leads to the following equation on nt'- 

dtnt = {b- d)nt -a [ U{x - y)it{dx)^t{dy) . (4.5) 

JxxX 

With the assumption U = 1, we recover the classical mean-field logistic equation of 
population growth: 

dtnt = {h — d)nt — anf. 

Comparing H4.5p with the first-moment equation (|3.,S() obtained previously stresses 
out the "decorrelative" effect of the large system size renormalization (only in case 
U = 1). In (|,S.3|1 . the correction term capturing the effect of spatial correlations in 
the population remains, even if one assumes U = 1. 

(3) Monomorphic and dimorphic cases without mutation. We assume here that 

the population evolves without mutation (parameter /x = 0); then the population 
traits are the initial ones. 

(a) Monomorphic case: only trait x is present in the population at time t = 
0. Thus, we can write = nQ{x)6x, and then X^^ = nf-{x)5x for any time t. 
Theorem 14.21 recasts in this case into nj^{x) —>■ nt{x) with ^t = n't{x)6x, and H4.,S|1 
writes 

^nt{x) = ntix){bix,ViO)ntix)) - dix,UiO)ntix))), (4.6) 

(b) Dimorphic case: when the population contains two traits x and y, i.e. when 
X^ = nQ{x)5x + Uq {y)Sy, we can define in a similar way nt{x) and nt{y) for any 
t as before, such that (,t = nt{x)5x + nt{y)5y satisfies (|4.,S|1 . which recasts into the 
following system of coupled ordinary differential equations: 
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^nt{x) = nt{x){b{x,V{0)nt{x)+V{x-y)nt{y))-d{x, U{0)nt{x) + U{x-y)nt{y))) 

(4.7) 

—nt{y) = nt{y){b{y,V{0)nt{y) + V{y-x)nt{x))-d{y,U{0)nt{y) + U{y-x)nt{x))). 

4.2 Large-population limit with accelerated births and deaths 

We consider here an alternative limit of a large population, combined with accelerated birth 
and death. This may be useful to investigate the qualitative differences of evolutionary 
dynamics across populations with allometric demographies (larger populations made up of 
smaller individuals who reproduce and die faster). 

Here, we assume for simplicity that X = W^. Let us denote by Mp the space 
We consider the acceleration of birth and death processes at a rate proportional to 
while preserving the demographic balance. That is, the birth and death rates scale with 
system size according to 

Assumption (H2): 

bK{x, C) = K^r{x) + b{x, C), dK{x, = K^r{x) + d{x, Q. 

The allometric effect (smaller individuals reproduce and die faster) is parameterized by 
the function r, positive and bounded over X , and the constant rj. As in Section 14.11 
the interaction kernels V and U are renormalized by K. Using similar arguments as in 
Section im the process = -^v^ is now a Markov process with generator 

L^0(z/) = K [ {K^r{x) + b{x, V * z^(x)))(l - i^K(.x)){cl){iy + 1<5,.) - (^{u))u{dx) 



+ K 



{K'^r{x) + b{x, V * iy{x)))nK{x) [ (0(zv + ^S^) - (I){u))Mk{x, z)dzv{dx) 
+ K f {K'^r{x) + d{x,U*y{x))){cp{v-^5:,)-(P{v))u{dx). 

As before, for any measurable functions 4> on Mp such that | </>(;/) | + (f){i')\ < C(l + 
(z^, 1)^), the process 

<A(Xf ) - 4>{X^) - f L^4>{Xf)ds (4.8) 
Jo 

is a martingale. In particular, for each measurable bounded function /, we obtain 

Mf'^ = (Xf,/)-(Xo^,/) 

[ {b{x,V*Xf{x))-d{x,U*Xf{x)))f{x)Xf{dx)ds (4.9) 

f [ fiK{x){K^r{x) + b{x,V*Xf{x)))( [ f{z)MK{x,z)dz-f{x)]xf{dx)ds, 

JR'i \Jr<* / 
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is a square integrable martingale with quadratic variation 



(M^-/), = 1/ r [ {2K''r{x) + b{x,V*Xf{x))+d{x,U*Xf{x)))f{x)Xf{dx)ds 



I Jo Jr'^ 



+ 




Two interesting cases will be considered hereafter, in which the variance effect fixMx 
is of order That will ensure the deterministic part in H4.9p to converge. In the 

large-population renormalization f Section |4.1|1 . the quadratic variation of the martingale 
part was of the order of 1/K. Here, it is of the order of x 1/K. This quadratic 
variation will thus stay finite provided that t] G (0,1], in which case tractable limits will 
result. Moreover, this limit will be zero if 77 < 1 and nonzero if 77 = 1, which will lead to 
deterministic or random limit models. 

4.2.1 Accelerated mutation and small mutation steps 

We consider here that the mutation rate is fixed, so that mutations are accelerated as a 
consequence of accelerating birth. We assume 

Assumptions (H3): 

(1) UK = 

(2) The mutation step density Mk{x, z) is the density of a random variable with mean x, 

variance-covariance matrix T,{x)/K^ (where S(x) = (Sjj(x))i<jj<rf) and with third 
moment of order uniformly in rr (e > 0). (Thus, as K goes to infinity, mutant 

traits become more concentrated around their progenitors'). 

(3) \/S denoting the symmetrical square root matrix of S, the function y/Y^r^i is Lipschitz 



The main example is when the mutation step density is taken as the density of a vector 
of independent Gaussian variables with mean x and variance a'^{x)/K^: 



Then the convergence results of this section can be stated as follows. 

Theorem 4.3 (1) Assume (H), (HI), (H2), (H3) and < < 1. Assume also that 
the initial conditions X^ converge in law and for the weak topology on Mp as K 
increases, to a finite deterministic measure ^q, and that 



continuous. 




(4.11) 



where (T^(x) is positive and bounded over W^. 



supE((Xf ,1)3) < +00. 



(4.12) 



K 
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Then, for each T > 0, the sequence of processes {X^) belonging to D([0, T],M^) 
converges (in law) to the unique deterministic function {^t)t>o G C([0, T], M^?) sat- 
isfying: for each function f G C^{M.'^), 

{^t, f) = (Co, f) + f I {b{x, V * - d{x, U * Ux)))f{x)Udx)ds 

Jo JR'' 

+ 11 \^i{x)r{x) ^ij{x)dlf{x)Udx)ds, (4.13) 

where dfjf denotes the second-order partial derivative of f with respect to x-i and Xj 

(X = (Xl, . . .,Xd)). 

(2) Assume moreover that there exists c > such that r{x)fi{x)s*Ti{x)s > c||s|p for any 
X and s m R"'. Then for each t > 0, the measure has a density with respect to 
Lebesgue measure. 

Remark 4.4 In case (2), Eq. \4-13\l may be written as 

dtit{x) = (h{x,V*it{x))-d{x,U*Ux))\t{x) + ]^ 4(r^S,,-6)(x). (4.14) 

Observe that, for the example i4-ll\ l, this equation writes 

dtit{x)= {h{x,V*it{x))-d{x,U*it{x))^it{x) + \Hcy^ri,it){x). (4.15) 

Therefore, Eq. j^. generalizes the Fisher reaction- diffusion equation known from classi- 
cal population genetics (see e.g. /^/J. 

Theorem 4.5 Assume (H), (HI), (H2), (H3) and rj = 1. Assume also that the initial 
conditions converge in law and for the weak topology on Mf{X) as K increases, to a 
finite (possibly random) measure Xq, and that sup^ -E'((X(^, 1)'^) < +oo. 

Then, for each T > 0, the sequence of processes (X^) converges in law in D([0, T], Mp) 
to the unique (in law) continuous superprocess X £ C{[0,T], Mp) , defined by the following 
conditions: 

sup E{{Xt,lf) <oo, (4.16) 



and for any f £ C? 



te[o,T] 



Mi = {Xt, f) - (Xo, f)-\f I Kx)r{x) Y ^^J{x)^y{x)Xs{dx)ds 

•^^'^ l<i,j<d 

f{x) {b{x, V * Xs{x)) - d{x, U * X,{x))) X,{dx)ds (4.17) 
is a continuous martingale with quadratic variation 

{M^)t = 2 [ [ r{x)f\x)X,{dx)ds. (4.18) 

Jo jR'i 
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Remark 4.6 (1) The limiting measure-valued process X appears as a generalization of 
the one proposed by Etheridge to model spatially structured populations. 

(2) The conditions characterizing the process X above can be formally rewritten as equation 

dtXt{x) = (b{x,V*Xt{x))-d{x,U*Xt{x)))xt{x) + ^ Yl dfj{rfiJ:,,Xt){x) + Mt 

^<i,j<d. 

where Mt is a random fluctuation term, which reflects the demographic stochasticity 
of this fast birth- and- death process, that is, faster than the accelerated birth- and- death 
process which led to the deterministic reaction- diffusion approximation J^. j5| ). 

(3) As developed in Step 1 of the proof of Theorem \4 -SI below, a Girsanov's theorem relates 

the law of Xt and the one of a standard super- Brownian motion, which leads to 
conjecture that a density for Xt exists only when d = 1, as for the super- Brownian 
motion. 

These two theorems are illustrated by the simulations of Figs. El (a), (c) and (d). 

Proof of Theorem 14.31 We divide the proof in several steps. Let us fix T > 0. 

Step 1 Let us first show the uniqueness for a solution of the equation (|4.1Hp . 

To this aim, we define the evolution equation associated with (|4.13p . It is easy to prove 
that if ^ is a solution of H4.13|l satisfying sup^^jo.T] ('^ti 1) < oo, then for each test function 
^^(x) = ijit^x) G C^^'^(M+ X M'^), one has 

iiu^t) = {io.^o) + I I ib{x,V*Cs{x))-d{x,U*Ux)))iP{s,x%{dx)ds 

Jo JR<* 

+ / / {dMs,x) + ]-r{x)fiix)y^^ij{x)df^iPsix))Udx)ds. (4.19) 
Jo JRd 2 ^ ' 

Now, since the function -v/Sr/i is Lipschitz continuous, we may define the transition semi- 
group {Pt) whith infinitesimal generator / i— > ^rjj.'^^jT^ijdfjf. Then, for each function 
/ G C|(M'^) and fixed t > 0, to choose V(s, x) = Pt-sf {x) yields 

{iu f) = (Co, Ptf) + f I {b{x, V * Ux)) - d{x, U * Ux)))Pt-sf{x)Udx)ds, (4.20) 

Jo JK<* 

since dsip{s,x) + ^r{x)fi{x) J2i j ^ij{x)dfjTps{x) = for this choice. 
We now prove the uniqueness of a solution of (|4.20p . 

Let us consider two solutions (^t)t>o and {^t)t>o of H4.20|l satisfying sup^g[o,r] (Ct + ^t, l) = 
At < +00. We consider the variation norm defined for ni and //2 in by 

||^i-Ai2||= sup \ {fj.1 - fi2, f) \- (4.21) 

/GL°°{R<'), ||/||cx><l 
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Then, we consider some bounded and measurable function / defined on X such that 
loo < 1 and obtain 



{Ct-Lf)\< 



+ 



+ 



[Udx) - Udx)] {b{x, V * Ux)) - d{x, U * Ux))) Pt~sf{x 
UdxMx, V * Ux)) - b{x, V * Ux)))Pt-sf{x) ds 
Udx){d{x, U * Ux)) - d{x, U * Ux)))Pt-sf{x) ds. 



ds 



(4.22) 



Since ||/||oo < 1, then \ \Pt-sf \\oo < 1 and for all x E M'^, 

V * Ux)) - d{x, U * Ux)))Pt~sf{x)\ < b + d{l + UAt). 

Moreover, b and d are Lipschitz continuous in their second variable with respective con- 
stants Ki, and K^. Thus we obtain from H4.22|l that 

I {^t - 6, /) I < + d{l + UAt) + KUtV + KdArU] f \ % - U\ds. (4.23) 

Jo 



Taking the supremum over all functions / such that ||/||oo < 1, and using the Gronwall 
Lemma, we finally deduce that for all t < T, — it\ \ = 0. Uniqueness holds. 

Step 2 Next, we would like to obtain some moment estimates. First, we check that 
for all T < oo, 

sup sup E{{X^,lf) <oo. (4.24) 

K tg[0,T] 

To this end, we use (|4.8|) with (j)^^) = {v, 1)^. (To be completely rigorous, one should first 
use (t){v) = {u, 1)^ A A, make A tend to infinity). Taking expectation, we obtain that for 
all t > 0, all K, 

E{{X,^,lf)=E{{X^,lf) 

^Jo^iL ([^'^'^(^) + ^ * { ^^^^^ 1^ + - ^^^^ 1^'} 

{K^'+Mx) + Kd(x, U * Xf (x))} I [(Xf , 1) - 1]3 - (Xf , if^^Xfidx)^ ds. 

Neglecting the non-positive death term involving d, we get 

E{{X,^,lf)<E{{X^,lf) 



+ 



^ (if''+V(x) |[(Xf , 1) + + [(Xf , 1) - - 2(Xf , 1)3 



+ Kb{x,V*X^ix)){[{X^,l) + 



(xf,i)n ]x^idx)]ds. 



But for all x > 0, all e G (0, 1], {x+ef-x^ < 6e(l+x2) and \{x+ef + {x-ef-2x^\ = 6e^x. 
We finally obtain 

E ((Xf , If) < E {{X^, 1)3) +C ['e ((Xf , 1) + (Xf , 1)2 + (Xf , 1)3) d.s. 

Jo 
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Assumption (|4.12|l and the Gronwall Lemma allows us to conclude that (|4.24jl holds. 
Next, we wish to check that 

sup^( sup <oo. (4.25) 

K te[o,T] 

Applying H4.9|l with / = 1, we obtain 

(Xf , 1) = {X^, 1) + r / {Kx, V * Xf (x)) - d{x, U * Xf (x))) X^{dx)ds + mf '\ 
Jo Jx 

Hence 

sup (Xf,l)2<cf(Xo^,l)2 + 6 [\xf,lfds+ sup iMf'ipY 

sg[0,t] V JO sg[0,t] / 

Thanks to H4.12|l . the Doob inequality and the Gronwall Lemma, there exists a constant 
Ct not depending on K such that 

E{ sup (Xf , 1)2) <Q{1 + E ((M^'i)i)) . 

se[o,t] 

Using now (|4.in|l . we obtain, for some other constant Ct not depending on 

E {{M^'')t) <C [\e ((Xf , 1) + (Xf , 1)2) )ds<Ct 
Jo 

thanks to (|4.24p . This concludes the proof of H4.25|) . 

Step 3 We first endow Mp with the vague topology, the extension to the weak 
topology being handled in Step 6 below. To show the tightness of the sequence of laws 
= £(X^) in V{O{[0,T],Mf)), it suffices, following Roelly to show that for any 
continuous bounded function / on M"^, the sequence of laws of the processes {X^ , f) is tight 
in D([0, r],M). To this end, we use the Aldous criterion and the Rebolledo criterion 
(see in]). We have to show that 

sup£^( sup KXf,/)|) <oo, (4.26) 

K t£[0,T] 

and the tightness respectively of the laws of the predictable quadratic variation of the 
martingale part and of the drift part of the semimartingales (X^,/). 
Since / is bounded, (|4.26|1 is a consequence of H4.25p : let us thus consider a couple {S,S') 
of stopping times satisfying a.s. 0<S<S'<S + 5<T. Using H4.1()|l and H4.25|l . we get 
for constants C, C 

E ((M^'^)5' - (M^''/)^) < CE y^'^' ((Xf , 1) + (Xf , 1)2) ds^ < C'6. 

In a similar way, the expectation of the finite variation part of (Xf , /) — (Xf , /) is bounded 
by C'6. 

Hence, the sequence = C{X^) is tight. 
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step 4 Let us now denote by Q the limiting law of a subsequence of . We still 
denote this subsequence by . Let X = {Xt)t>Q a process with law Q. We remark that 
by construction, almost surely, 

sup sup \{Xf,f)-{X^^J)\<l/K. 

te[o,T] /eL°°(iR''),||/||oc<i 

This implies that the process X is a.s. strongly continuous. 

Step 5 The time T > is fixed. Let us now check that almost surely, the process X 
is the unique solution of (|4.1,S|I . Thanks to H4.25|l . it satisfies supjg[o,T] (^t' 1) < ^-S-, 
for each T. We fix now a function / G Cf{W^) (the extension of (j4.13p to any function / 
in C| is not hard) and some t <T. 
For u e C{[0,T],Mf), denote by 

(i^) = Wt, f) - {i^o, f) - [ [ {b{x, V * u,{x)) - d{x, U * Us{x)))f{x)iys{dx)ds, 







^t(^) = - f I ll^{x)r{x)J2^ij{x)dy{x)us{dx)ds. (4.27) 
Jo JR-i ^ 



We have to show that 

Eq{\^\{X)^^\{X)^=^. (4.28) 
By (|4.9|1 . we know that for each 

Mf = ^\{X^) + ^i'^(X^), 

where 

^-2'^(X^) = - r / /i(x)(i^V(x) + 6(x, y * (x))) 

/ !{z)MK{x,z)dz- f{x)\xf{dx)ds. (4.29) 
Moreover, H4.25jl implies that for each 

E (|Mf '/|^) = E ((Af^<'/),) < ^E {(Xf , 1) + (Xf , 1)^} d.) < 

(4.30) 

which goes to as ii' tends to infinity, since < a < 1. Therefore, 

\irnE{\^!\{X^) + ^}^{X^)\)=Q. 

Since X is a.s. strongly continuous, since / € C|(M'^) and thanks to the continuity of 
the parameters, the functions and ^'^ are a.s. continuous at X. Furthermore, for any 
1/ GD([0, T],Mf), 

\^>\{v) + ^]{u)\ < Cf,T sup (1 + {Us, if) . (4.31) 

se[o,T] 
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Hence using H4.24jl . we see that the sequence {"^j{X^) + "^^{X^))x is uniformly integrable, 
and thus 

]imE imx^) + = E {\^1{X) + . (4.32) 

We have now to deal with "^f'^ {X^) — "^^{X^). The convergence of this term is due to 
the fact that the measure Mxix, z)dz has mean x, variance T,{x)/K^, and third moment 
bounded by C / K^^^ (e > 0) uniformly in x. Indeed, if Hf{x) denotes the Hessian matrix 
of / at X, 

f{z)MK(x,z)dz 

fix) + {z-x)- V/(x) + ^iz- xYHf{x){z -x) + 0{{z - xfU Mk{x, z)dz 



= /(-) + ^E^4/(-) + «(^)- (4-33) 
where K'^o{^) tends to uniformly in x (since / is in C^), as K tends to infinity. Then, 
^t'^{X^) = - f [ ^^{x){K'^r{x)+b{x,V*Xf{x)))x 

Jo JR'i 

\Y.^dy{x) + o{^))xf{dx)ds, 



and 

\^]^^[x^) - mj{x^)\ <Cf< , i>[^ + K'o{^)^ . 

Using H4.25|l . we conclude the proof of H4.28|l . 

Step 6 The previous steps imply that {X^)k converges to ^ in D([0, T], Mp), where 
Mp is endowed with the vague topology. To extend the result to the case where Mp is 
endowed with the weak topology, we use a criterion proved in Meleard and Roelly |17| : 
since the limiting process is continuous, it suffices to prove that the sequence ((X^,l)) 
converges to (^,1) in law, in D([0, r],]R). One may of course apply Step 5 with / = 1, 
which concludes the proof. 

2) Let us now assume the non-degeneracy property r{x)^{x)s*T,{x)s > c||s|p > for 
each X G M"', s G M.'^. That implies that for each time t > 0, the transition semigroup 
Pt{x,dy) introduced in Step 1 of this proof has for each x a density function pt{x,y) with 
respect to the Lebesgue measure. Then if we come back to the evolution equation (|4.2()p . 
we can write 



/ 



f{x)it{dx)= I I f{y)pt{x,y)dy] ^o{dx) 

+ /*/ {b{x,V*Ux))-d{x,U*Ux)))[ I f{y)pt-s{x,y)dy\Udx)ds. 
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Using the fact that the parameters are bounded, that sup^<2-(^i, 1) < +00 and that 
/ is bounded, we can apply Pubini's theorem and deduce that 



fixMdx) = / Ht{y)f{y)dy 

with H G L°°([0, T], L^(M'^)), which implies that has a density with respect to the 
Lebesgue measure for each time t <T. 

Equation is then the dual form of □ 



Proof of Theorem l4.5l We will use a similar method as the one of the previous theorem. 
Steps 2, 3, 4 and 6 of this proof can be achieved exactly in the same way. Therefore, we 
only have to prove the uniqueness (in law) of the solution to the martingale problem (|4.1f)|l - 
H4.18|l (Step 1), and that any accumulation point of the sequence of laws of is solution 
to (ITlfill - lITT^ (Step 5). 

Step 1 This uniqueness result is well-known for the super-Brownian process (defined 
by a similar martingale problem, but with b = d = 0, r = ij, = l and T, = Id, cf. |2()|). 
Following (Hj, we may use the version of Dawson's Girsanov transform obtained in Evans 
and Perkins (Theorem 2.3), to deduce the uniqueness in our situation, provided the 
condition 

e( [ I [h{x,V*Xs{x))-d{x,U*Xs{x))f Xs{dx)d^ < +00 

is satisfied. This is easily obtained from the assumption that sup^£jo,T] -^[(^t) 1)^] < ^ 
since the coefficients are bounded. 

Step 5 Let us identify the limit. Let us call = C{X^) and denote by Q a 
limiting value of the tight sequence , and hy X = {Xt)t>o a process with law Q. 
Because of Step 4, X belongs a.s. to C{[0,T], Mp). We have to show that X satisfies the 
conditions l|4.16|l . (|4.17|l and (|4.18|l . First note that (|4.16|l is straightforward from (|4.25p . 
Then, we show that for any function / in C^{W^), the process m/ defined by lITTTIl IS a 
martingale (the extension to every function in is not hard). We consider < si < ... < 
Sn < s < t, some continuous bounded maps (pi, ...(pn on Mp, and our aim is to prove that, 
if the function ^ from O{[0,T],Mf) into M is defined by 

'I'(z^) = M^s,)...M'^sj{{iyt, f) - {i^s, f) 

~ Is j J^^^^^'^'^^^''^^'^^'^i^^^'^ + fi^) [b{x,V *Uu{x)) - d{x,U * Uuix))]^ Uu{dx)du^ , 

(4.34) 

then 

E{^{X)) = Q. (4.35) 

It follows from H4.9p that 

= E [mx!i)...MXZ) {Mf - Mf '/}) = E {^{X^)) - Ak, (4.36) 
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where is defined by 



fi{x)\b{x, V * X^{x)) \ [ ifiz) - f{x))MK{x, z)dz 
+ r{x)K [Jjfi^) - /(^) - E ^^dy{x))MK{x, z)dz] }xf (dx)rf 



It turns out from H4.H;-i|l that Ak tends to zero as K grows to infinity, and using (|4.25|l . 
that the sequence {\'^{X^)\)k is uniformly integrable, so 



limE {\-^(X 



Eq{\^{X)\) 



(4.37) 



Collecting the previous results allows us to conclude that l|4.,S5|l holds, and thus is a 
martingale. 

We finally have to show that the bracket of is given by H4.18|l . To this end, we first 
check that 

nI = {Xt, ff - {Xo, ff-[ [ 2r{x)f\x)Xs{dx)ds 

Jo Jw 

-2 I {Xs, f) [ fix) [bix, V * Xs{x)) - d{x, U * X,{x))] Xs{dx)ds 

{Xs,f)[ fx{x)r{x)Y,^ij{x)dy{x)Xs{dx)ds (4.38) 



2 J 



is a martingale. This can be done exactly as for M/ , using the semimartingale decom- 
position of (X/^,/)^, given by H4.8p with (^(u) = {i/,/)"^. In another hand, Ito's formula 
implies that 

(Xtjf - (Xojf - (M^t - fds2{XsJ) [ Xs{dx)lr{x)fi{x)Y,^,j{x)dy{x) 

Jo jRd 2 ~i 



ds2{Xs, f) / Xs{dx)f{x) [b{x, V * Xs{x)) - d{x, U * Xs{x))] 
Jw^ 

is a martingale. Comparing this formula with H4.38|l . we obtain (|4.18p . □ 



4.2.2 Rare mutations 

In this case, the mutation step density M is fixed and the mutation rate is decelerated 
proportionally to 

Assumption (H4): 

Mk = M, flK = 

Thus only births without mutation are accelerated. 

As in Section 14.2. 1| we obtain deterministic or random limits, according to the value 
of rye (0,1]. 
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Theorem 4.7 (1) Assume (H), (HI), (H2), (H4) and < t] < 1. Assume also that the 
initial conditions Xq converge in law and for the weak topology on MpiX) as K 
increases, to a finite deterministic measure (^nd that sup^ E{{X^ , 1)^) < +oo. 

Then, for each T > 0, the sequence of processes (X^) belonging to I]i{[0,T], Mp) 
converges (in law) to the unique deterministic function {S,t)t>o £ C{[0,T], Mp) weak 
solution of the deterministic nonlinear integro- differential equation: 

dtUx) = [b{x,V*^t{x))-d{x,U*^t{xmt{x)+ [ M{y,x)fi{y)r{y)^t{y)dy. (4.39) 

(2) Assume now 77 = 1 and that Xq converge in law to Xq. Then, for each T > 0, the 
sequence of processes (X^) converges in law in IJi{[0,T], Mp) to the unique (in law) 
continuous superprocess X G C{\f},T\,Mp), defined by the following conditions: 

sup E{{Xt,lf) < 00, 

ie[o,T] 

and for any f £ C^i^'^), 

Mi = {Xt, f) - {Xo, f) - f [ fx{x)r{x) [ M{x,z)f{z)dzX,{dx)ds 

- I I f{x){b{x,V*Xsix))-d{x,U*Xs{x)))Xs{dx)ds 
Jo Jr'' 

is a continuous martingale with quadratic variation 

{M^)t = 2 f I r{x)f\x)Xs{dx)ds. 
Jo Jwi 

In a SPDE formalism, one can write the last limit as formal solution of the equation 

dtXtix) = [bix,V*Xt{x))-d{x,U*Xt{x))]Xt{x)+ [ M{y,x)ii{y)r{y)Xt{dy)+M, (4.40) 

where M is a random fluctuation term. 

The proof of Theorem 14.71 is similar to proofs of Theorems 14.31 and 14.51 and we leave it 
to the reader. Theorem 14.71 iV\ is illustrated in the simulation of Fig. 121(b). 

5 Rare mutation renormalization of the monomorphic pro- 
cess and adaptive dynamics 

In the previous section, Eqs. (|4.39|l and (|4.4()|l have be obtained at the population growth 
time scale (ecological time scale), under an assumption of rare mutation. Here, we are 
interested in the behavior of the population process at the evolutionary time scale, when 
mutations are extremely rare, as illustrated by the simulation of Fig. ^ (d). We hence 
recover rigorously the stochastic "trait substitution sequence" jump process of adaptive 
dynamics (Metz et al. ^H]) when the initial condition is monomorphic. The biological 
idea behind such a scaling of the population process is that selection has sufficient time 
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between two mutations to eliminate all disadvantaged traits, so that the population remains 
monomorphic on the evolutionary timescale. Then the evolution proceeds by successive 
invasions of mutant traits, replacing the resident trait from which the mutant trait is born, 
occuring on an infinitesimal timescale with respect to the mutation timescale. Our result 
emphasizes how the mutation scaling should compare to the system size (K) in order to 
obtain the correct time scale separation between the "mutant-invasions" (taking place on 
a short time scale) and the mutations (evolutionary time scale). 



5.1 Statement of the result 

We consider here a limit of rare mutations combined with the large population limit of 
Section im (Assumption (HI) and hx = b, dx = d and Mk = M). We assume 
Assumptions (H5): 

(i) ^ik{x) = UKfJ-ix). 

(ii) For any constant C > 0, 

e-^"" « UK « ^/ ^ (5.1) 

A log A 

(thus UK — > when K +oo), or, equivalently, for any C and t > 0, 

logic < <e^^. (5.2) 
Kuk 

(iii) For any x E X , ( b{x, Q and ( i— > d{x, () are positive functions, non-increasing 
and increasing respectively, satisfying 



Vx G X, b{x,0) - d{x,0) > 0, 
(iv) There exists a constant U_> such that U{h) > U_ for any /i G M'^ 



lim inf d(xX) = +oo. (5.3) 



Assumption (H5)-(i) entails the rare mutation asymptotic, and (H5)-(ii) gives the cor- 
rect scaling between the mutation probability and the system size in order to obtain the cor- 
rect time scale separation. Observe that (H5)-(ii) implies that Kuk — > when K — > -|-oo, 
so that the timescale t/KuK, which corresponds to the timescale of mutations (the popu- 
lation size is proportional to K, and each birth event produces a mutant with a probability 
proportional to uk, which gives a total mutation rate in the population proportional to 
Kuk) is a long timescale. Our result gives the behavior of the population process on this 
long timescale. 

Assumptions (H5)-(iii) and (iv) will allow to bound the population size on the muta- 
tion timescale, and to study the behavior of the population when it is monomorphic or 
dimorphic between two (rare) mutation events. Specifically, the monotonicity properties 
of b and d in Assumption (H5)-(iii) ensures, for any x G ^Y, the existence of a unique 
non-trivial stable equilibrium n{x) for the monomorphic logistic equation H4.6p of Exam- 
ple 3 in Section Wl\ Moreover, since b{x^V{Q)u) — d{x,U{0)u) > for any u < n{x) and 
b{x, V{0)u) — d{x, U{0)u) < for any u > n(x), any solution to (|4.6|1 with positive initial 
condition converges to n{x). 
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Concerning the dimorphic logistic equations H4.7|l . an elementary linear analysis of the 
equilibrium (n(x), 0) gives that it is stable if /(y, x) < and unstable if f{y, x) > 0, where 
the function 

f{y, x) = b{y, V{y - x)n{x)) - d{y, U{y - x)n{x)) (5.4) 

is known as the "fitness function" (jlHl^^), which gives a measure of the selective advantage 
of a mutant individual with trait y in a monomorphic population of trait x at equilibrium. 
Similarly, the stability of the equilibrium (0,n(y)) is governed by the sign of f{x,y). 

In order to ensure that, when the invasion of a mutant trait is possible, then this invasion 
will end with the extinction of the resident trait, we will need the following additional 
assumption: 

Assumptions (H6): 

Given any x £ X, Lebesgue almost any y £ X satisfies one of the two following 
conditions: 

(i) either f{y,x) < (so that (n(x),0) is stable), 

(ii) or f{y, x) > 0, f{x, y) < and any solution to (|4.7|1 with initial condition with 
positive coordinates in a given neighborhood of (n(x),0) converges to (0, n(y)). 

In the case of linear logistic density-dependence introduced in Section 12.21 {b{x, C) = 
b{x) and d{xX) = d{x) + a(x)C), the equilibrium monomorphic density n{x) writes 
(6(x) — d{x))/a{x)U{0) and the condition (H6)-(ii) is actually equivalent to f{y,x) > 
and f{x,y) < (see |SI). 

Our convergence result writes 

Theorem 5.1 Assume (H), (HI), (H5) and (H6). Given x G A', 7 > and a sequence 
ofH-valued random variables {'^K)K&'n, such that ^k/K is hounded in "L^ and converges 
in law to 7, consider the process {X^ ,t > 0) of Section^ generated by i4.1[ l with initial 
state -%^x- Then, for any n > 1, e > and < ti < t2 < ■ ■ ■ < tn < 00, and for any 
measurable subsets Ti, . . . , r„ of X , 



hm P(Vi e {1, . . . 3x, G : Supp(X,f/^„^J = {xj 

and\{X^,^^.l)-n{x,)\<e)=P{yie{l,...,n}, Yt^eT^ (5.5) 



where for any v G Mp{X^, Supp(z^) is the support of v and {Yt,t > Qi) is a Markov jump 
process with initial state x generated by 



Aip{x) = {ip{y) - ip{x))g{y,x)M{x,y)dy (5.6) 

where 

5(y,x) =/.(x)6(x,y(0)n(x))n(x)— (5.7) 

Ky^yiy - x)n[x)) 

and denotes the positive part. 

Corollary 5.2 With the same notations and assumptions as in Theorem \5.1\ assuming 
moreover that ^k/K is bounded in for some g > 1, the process {X^j^,^^,t > 0) converges 
when K — > +00, in the sense of the finite dimensional distributions for the topology on 
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Mp{X) induced by the functions v ^ (z^j /) with f bounded and measurable on X, to the 
process {Zt,t > 0) defined by 

* \ n{Yt)6Y, ift>0. 

This corollary follows from the following long time moment estimates. 

Lemma 5.3 Under (H), (HI), (H5)(m) ^EM and (iv), and i!/sup^>i E{{X^ , l)?) < +oo 
for some q > 1, then 

sup supEdX^,!)") < +00, 

K>1 t>0 

and therefore, if q > 1, the family of random variables {(^/^, l)}{_ft'>i, t>o} is uniformly 
integrable. 

Proof of Lemma 15.31 Observe that, if we replace b{x, V * u) hy b and d{x, U * u) by 
g{lL{i^,1.)) where g{C) ■= 'mix£x d{x,() in the indicator functions of each terms of the 
construction H2.,S|1 of the process Xf^, we can stochastically dominate the population size 
{X^,l) by a birth and death Markov process {Z^)t>Q with initial state = {X^ ,1) 
and transition rates 

i2b imm.i/K to {i + l)/K, 

igiJLji) from i/K to [i - l)/K. 

Therefore, it suffices to prove that sup^>o sup^g < +cxd. 

Let us define = P{Zf = k/K). Then 

k>l ^ ' 

^ 5] fc" \lb{k - \)p\-^ + (fc + \)g -kUb^g 



K1 

k>l 
k>l 



k'i+'pi. 



Now, by (H5) (iii) l|5.,Sp . g{Q) — > +oo when a +oo, so there exists oq such that, for any 
a ^ ao, 9{a) > 46. Therefore, for k > Kao, 26((1 + l/Zc)"? - 1) + g{k){{l- l/kf - 1) < 
-26[3 - 2(1 - l/kY - (1 + l/kY], the RHS term being equivalent to -2bq/k. Therefore, 
enlarging uq if necessary and using the fact that (1 + — 1 < 0(2*^ — 1) for any a £ [0, 1], 
we can write 

lKao]-l / i, \<i / h \i 

Iehz-y) < 26(2^ - 1) ( ^ ) - E ( ^ J 

k=l ^ ^ k>\Kao\ ^ ^ 

<2b{2'i -l)al + bqal-bqE{{Z^Y). 
Writing C = (2(2'? — 1) + q)a^/q, this differential inequality solves as 

E{{Z^f) <C+ mX^, m - C)e-''^\ 
which gives the required uniform bound. □ 
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Proof of Corollary 15.21 Let T be a measurable subset of X. Let us prove that 

lim E[{X^j,^^,lr)]=E[n{Yt)lY,er]. (5.8) 

By (H5)-(iii)-|(OI), there exists Co > such that for any C > Co and x G A", d{x,C,) > 
b. Therefore, by (H5)-(iv), for any x G X, n{x) £ [OXo/lL]- Fix e > 0, and write 
[0,Co/fZ!] C uf^^Ii, where p is the integer part of Co/ilL^), and Ij = [(i — l)e, ie[. Define 
Ti = {x £ X : n{x) G Ij} for 1 < i < and apply (|5.5|1 to the sets F n Fi, . . . , F n Fp with 
n = 1, ti = t and the constant e above. Then, by Lemma IO| for some constant C > 
and for sufficiently large K, 

limsupi^[(X,^^„^,lr)] < limsupi^[(X,^^„^^,lr)l(vK ^^^^c] +^ 

K^+co K^+oo 

P 

< X]^™'^^^P^[^^Vi^"if'^rnr,)l(x,5^ ,i)<c] +e 

^ K->+oo ' ^ t/Kui^^ I- 

P 

< + G F n FO + e 

i=l 
P 

< J2 {E[n{Yt)lx,ernr,] + 2eP{Yt G F,)) + e 

i=l 

<E[n{Yt)lY,er]+3e. 

A similar estimate for the liminf ends the proof of IjB.Sp . which implies the convergence 
of one-dimensional laws for the required topology. 

The same method gives easily the required limit when we consider a finite number of 
times ti, . . . ,tn- n 

Observe that the fact that the limit process is not right-continuous prevents the possi- 
bility to obtain a convergence for the Skorohod topology on J]){\}},T], Mp{X)) . 



5.2 Idea of the proof 

Theorem 15.11 can be proved in a similar way as in Champagnat jH]. Let us give an idea 
of the method in order to explain the assumptions, the various parameters appearing in 
Theorem 15. II and the tools involved in the proof. It is based on two ingredients: the study 
of a monomorphic population before the first mutation, and the study of the invasion of a 
single mutant individual in this population. 

1) The first part obtains from large deviation results for the convergence of to 
nt{x)5x when the initial population is monomorphic with trait x, where nt{x) satisfies H4.6p . 
Any positive solution to (|4.H|1 converges to fi{x) when t — > +00, and hence reaches a given 
neighborhood of n{x) in finite time, i.e. on an infinitesimal time scale with respect to 
the mutation time scale. Large deviations theory allows us to show that the exit time of 
(Xf ,1) from this neighborhood behaves as exp(i('C) for some C > (problem of exit 
from a domain, Preidlin and Wentzell ^5)- Thanks to the right part of Assumption H5.2p . 
we can prove that, with high probability, (X/*- , 1) is close to n{x) when the first mutation 
occurs. Therefore, the total mutation rate is close to UKl^{x)Kn{x)h{x, F(0)n(x)) and so. 
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on the mutation time scale t/Kux, the rate of mutation is close to n{x) ^i{x)b{x , V{0)n{x))^ 
which explain the left part of the RHS of (|5.7|1 . This argument can be made rigorous using 
stochastic domination results similar to the one used at the beginning of the proof of 
Lemma and leads to the following result: 

Lemma 5.4 Let n denote the first mutation time and P^k the law of with initial 
state . Given x ^ X and a sequence of integers {zk)k>i such that zk / K ^ z > Q, 

(a) For any e > 0, 

lim P^, (ri>logi^, sup , 1) - > e ) = (5.9) 

K^+oo K°^\^ te[\ogK,T{\ J 

and 

lim Pf^f, (n < \ogK) = 0. 
In particular, under P^^ , -^i^gK ~^ f^{x)Sx and X^_ —>■ n{x)5x in probability. 

(b) For any t > Q, 

lim Pf,^ , ( n > — ^ ) = exp ( - f3{x)t) , 
where (3{x) = ^(x)n(x)6(x, F(0)n(a;)). 




Figure 3: The three steps of the invasion and fixation of a mutant trait y in a monomor- 
phic population with trait x. Plain curves represent the resident and mutant densities 
{X^ , and {X^ , '^{y})i respectively. Dotted curves represent the solution of Eq. I|4.7jl 
with initial state nQ{x) = n{x) and no(y) = £■ 

2) The study of the invasion of a mutant individual with trait y can be divided in three 
steps represented in Fig. El 
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Firstly, the invasion of the mutant (between and ti in Fig. can be defined as the 
growth of the mutant density {X^^ , l{y}) from 1/K (one individual) to a fixed small level 
e {eK individuals). As long as the mutant density is small, the dynamics of the resident 
density {Xl^,l^^y) is close to the one it followed before the mutation, so it is close to 
n{x) with high probability. Therefore, between and ti, the birth and death rates of an 
individual with trait y are close to b{y, V{y — x)n{x)) and d{y, U{y — x)n{x)) respectively. 
Therefore, the number of mutant individuals is close to a binary branching process with the 
parameters above. When K — > +00, the probability that such a branching process reaches 
level eK is close to its survival probability, which writes [f{y, x)]+/b{y, V{y — x)n{x)). This 
gives the second part of the RHS of (|5.7|1 . 

Secondly, once the invasion succeeded (which is possible only if f{y,x) > 0), the dy- 
namics of the densities of traits x and y are close to the solution to the dimorphic logistic 
equation H4.7p with initial state {n{x),e), represented in dotted curves between ti and t2 
in Fig. IHl Because of Assumption (H6), the resident density can be proved to reach level 
e with high probability (at time t2 in Fig. ISJ. 

Finally, a similar argument as in the first step above allows us to prove that the resi- 
dent population density {X^^ , l^x}) follows approximately a binary branching process with 
birth rate b{y,V{x — y)n{y)) and death rate d{y,U{x — y)n{y)). Since f{x,y) < by 
Assumption (H6), this is a sub-critical branching process, and therefore, the resident trait 
X disappears in finite time with high probability. 

We can show, using results on branching processes, that ti and — t2 are of order 
log A', whereas t2 — ti depends only on e. Therefore, the left part of (|5.2|1 ensures that the 
three steps of the invasion are completed before the next mutation, with high probability. 
The previous heuristics can be made rigorous using further comparison results, and leads 
to the following result. 

Lemma 5.5 Assume that the initial population is made of individuals with traits x and 
y satisfying assumption (H6) (i) or (ii). Let 0q denote the first time when the population 
gets monomorphic, andVo the remaining trait. Let {zk)k>i be a sequence of integers such 
that zk / K ^ n{x) . Then, 

hm P^^ (Fo = y) = — (5.10) 

K^+co -t^^+K^v^ ' b{y,V{y - x)n{x)) 

lim Pz^ . , 1 . (Vo = 2;) = 1 - 77 777 , , , (5.11) 

K^+<yo -t^^+K^y^ ' b{y,V{y - x)n{x)) ' 

Vr/>0, lim P?^ , ( > ^ ^ ) = (5-12) 

and ye > 0, Urn P^k^ , 1 5 (K^e"^, 1) - n{Vo)\ < e) = 1, (5.13) 

where f{y,x) has been defined in i5.4\ )- 

Once these lemmas are proved, the proof can be completed by observing that the 
generator A of the process {Yt,t > 0) of Theorem 15. II can be written as 



A^{x)= {ip{y) -^{x))p{x)K{x,dy), (5.14) 
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where has been defined in Lemma JOl and the probability measure k{x, dh) is defined 

by 



This means that the process Y with initial state x can be constructed as follows: let 
{M{k),k = 0,1,2,...) be a Markov chain in X with initial state x and with transition 
kernel K{x,dy), and let {N{t),t > 0) be an independent standard Poisson process. Let also 
(?n)n>i denote the sequence of jump times of the Poisson process N. Then, the process 
> 0) defined by 

Yt:=M(N( [ l3{Ys)ds 



\ \Jo 

is a Markov process with infinitesimal generator (|5.14p (cf. pTO] chapter 6). 

Let Px denote its law, and define (5„)„>i by Tn = J^" P{Ys)ds. Ob serve that any 
jump of the process Y occurs at some time 5„, but that all Sn may not be effective jump 
times for Y, because of the Dirac mass at x appearing in (|5.15p . 

Fix t > 0, X £ X and a measurable subset T of X. Under P^, Si and Ysi are 
independent. Si is an exponential random variable with parameter P{x), and Ys^ has law 
k{x,-). Therefore, for any n > 1, the strong Markov property applied to Y at time 5i 
yields 



PASn < t < Sn+1, Yt G r) 

ft 

^(^)e-/3(x)s 



/ Py{Sn-l <t-S<Sn, Yt_s G r)K{x, dy)ds (5.16) 

and 

Px(0 <t<Si,YteT) = l{,.er}e"^^"^*- (5-17) 
Using the Markov property at time ri and Lemmas 15.41 and 15. 5( we can prove that, 
when we replace Sn by the n-th mutation time of X^jj^^K ^'^'^ ^* ^■^ support of X^j^^^ 
(when it is a singleton) in the LHS of (|5.16p and (|5.17|1 . the same relations hold in the 
limit K — > +00. Therefore, Theorem 15. II is proved for one-dimensional time marginals. A 
similar method generalizes to finite dimensional laws. 
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